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\ . Introduction 

The National Assessment of Educational Progress (NAEP) currently 
tests seventeen year-old students enrolled in public and private 
secondary schools but it does no£ test "out-of -school" seventeen year- 
olds.^ "Out- of- school" refers to students not enrolled in elementary 
or secondary schools. The out-of -school seventeen-year-olds include 
both those seventeen year-olds who have dropped out of school as well 
as those who graduated from high school. As we estimate that 1 out of 
every 5 seventeen-year-olds is "out-of-school"^ ^ the omission of these 
persons from NAEP weakens the interpretability of NAEP findings. For 

^The author is Associate Professor, Departments of Statistics and 
Human Development and Social Policy and Center for Urban Affairs and 
Policy Research, Northwestern Un.Wersity, and Director, Methodology 
Research Center, NORC. 
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Here a seventeen-year-old is a person who had attained age 17 by 
October 1 of whatever academic year or calendar year is under 
cons iderat ion . 

This figure is derived as follows. The 1980 census counted a 
total of 77,000 seventeen-year-olds (as of 4/1/80) in institutions or 
group quart:ers (U.S. Bureau of Census, 1980 Census of Population . 
PC80-2-4D, Tables 2 and 44). The civilian noninstzitutional population 
of seventeen-year-olds (as of ? 0/1/80) numbered 4,097,000, of which 
3,393,000 vsrc?. "In school" (U.S. Bureau of Census, Current Population 
Repoyt;s P-20, Nc. 400, Table 41). Ignoring the minor diffe Tences in 
the definition of seventeen-year-old, and conservatively estimating 
that the group quarters and institutional seventeen-year- olds were not 
"in school", we estimate that about 3,393/4,174 or 81% were "in 
school". We assume that this proportion holds constant over time. 
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example, an improvement in the drop-out rate for students accompanied 
by no change in the performance levels of all seventeen-year-olds would 
cause a decline in NAEP scores for seventeen-year-olds.^ Here NAEP 
findings would appear to indicate a deterioration in the educational 
system, when in fact there was an improvement- -fewer students dropped 
out, and performance levels of students were unchanged. Or, for 
another exampl'.e, a finding that 90% of the tested seventeen-year- olds 
could read at a certain level could well mear> that fewer than 80% of 
all seventeen-year- olds could read at that level. ^ 

Generally, we are interested in the performance of all seventeen- 
year-olds, not the selected majority who are secondary school students. 
Of course, attention may iocus on all seventeen-year-olds who live in a 
particular state or who belong to a particular subgroup. When one con- 
siders assessing the performance of all seventeen-year-olds, it is 
natural to consider assessing the performance of other out-of -school 
populations as well. For example, what is the literacy level of adults 
aged 21-25 and is literacy improving over time? What knowledge (as 
opposed to attitudes) does the voting-a3e population possess of nuclear 
energy, the strategic strength of the nation, the proportion of the 
federal budget that goes to education or other social services or 
defense? The viability of a democracy must depend in part on the know- 
ledge of the voters, on an educated electorate. Assessment of adult 

^This decline ariso.s because those students ^;ho in past years 
would have dropped out of school but now remain isi school tend to score 
lower than the population of seventeen-year- olds as a whole. 

^The oretically, as few as 72% could read at that level in the 
(implausible) event that none of the out-of -school seventeen-year- olds 
could read at that level. 
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("out-of -school") populations is vital to ensuring that the voting 
population is sufficiently educated to carry out its responsibilities, 
yet distressingly little such assessment ir conducted. \Ttiile noting 
the importance of assessing parsons at older ages, we restrict consi- 
deration in this paper to (the out-of -school) seventeen-year- olds , 

2 . Considerations of Precision and Sanple Size 

Statistics produced by any data collection program are subject to 
various kinds of errors, and the goal of efficient survey design is to 
strike a favorable balance between the level of error anc" :he cost of 
gathering the data and producing the statistics. NAEP utilizes 
sampling, and an important source of error is sampling variability. To 
measure sampling variability we will use the notion of a standar d 
error, which may be simply interpreted as the typical size of the 
difference between a statistic calculated from a particular sample and 
what the statistic would have been if the entire population had been 
surveyed. Typically, the chances are about two out of three that: the 
difference (or error) is smaller in size than one standard er-^or, and 
they are about nineteen out of twenty (95%) that 'She difference (or 
error) is smaller than two standard errors. The coefficient of 
variation ( c .v. ) expresses the standard deviation as a percentage of 
the number we are trying to estimate. For example, if we esytimete the 
proportion of seventeen-year-olds who can read at a certain level to be 
40% and the standard error is 8%, then the c.v. is estimated -.o be 20% 
( - .08/. 40), 
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One can reduce the standard error by (1) increasing the sample 
size, (2) by keeping the sample size fixed and increasing the effi- 
ciency of the sample design (i.e., changing the way the sample is 
selected so that the precision is increased), or (3) by a combination 
of changing the sample size and changing the efficiency. Increasing 
the efficiency of a sample design may increase the cost, and for that 
reason the most cost-effective strategy may not be to use the most 
efficient design. To facilitate comparisons of alternative sample 
designs we will use the notion of effective sample size , which we will 
denote by n^ff. Those readers familiar with fhe concept of design 
effect or def f will note that the number of seventeen-year-olds in the 
sample, say n, is equal to the pvoduct of the design effect and the 
effective sample size, or n^^f « u/deff. 

How_large a sample of seventecn-ysar-olds is needed depends of 
course on what statistics ars of interest and how accurate they need to 
be. We will consider four types of statistics here- end we will 
indicate the sample sizes necessary to attain alternative target levels 
of precision in thci statistics. The statistics include comparisons of 
average scores for states, average scores for population subgroups, 
comparisons across states of proportions of seventeen-year-olds 
performing at a given level, and proportions of subpopulations 
performing at a given level. The scores will be assumed to be standard 
scores, i.e., scores expressed in standard units, with standard 
deviations equal to 1. 

The standard error of a comparison of the average standard score 
in one state with the average standard score in another state is 
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approximately equal to the square-root of the sura of the reciprocals of 
the effective sample sizes in the tvo states. Table 1 shows what 
sample sizes are needed to j^ield various standard errors when the same 
size sample is taken in eac'^ 'uate. 



TABLE 1 

Standard Error of Comparison of Average Standard Scores 
in Two States as a Function of Sample Sizes 

Effective Sample Size in Each State 



Standard Error 



1% 2% 2.5% 3%4i5%6%7%8%9%10% 

20,000 5.000 2.263 2,222 1,250 820 556 408 312 247 200 



Thus, an effective sample size of 312 in each state yields comparisons 
of state average standard scores with standard errors of 0.08. Table 1 
may also be used to analyze the sample sizes necessary for com- 
paring subgroups across states. Tor example, to compare the average 
standard score for males in state A with females in state B with a 
standard error of 10%, it is sufficient to have an effective sample 
size of 200 males from state A and 200 females from state B. 

Table 2 shows the sample sizes needed to attain various standard 
errors, not of comparisons across states, but simply of the average 
standard score within a state or other subpopulation. 
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TABLE 2 

Standard Error of Average Standard Score 
for a Subpopulation as a Function of Sample Size 

Effective Sample Size in Subpopu-lation 



Standard Error 



1% 2% 2.5% 3%iL%5%6%7%8l9%10i 
10,000 2,500 1,600 1,111 625 400. 278 204 156 124 100 



Sometimes, we are interested in comparisons across states of the 
proportions of students who can perform at certain levels (e.g.., answer 
a given item correctly) . The precision of such a comparison depends on 
the actual proportions in each state as well as on the sample sizes. 
Table 3 shows how the standard error and coefficient of variation of 
such a comparison vary with the underlying proportions and the 
effective sample sizes. For example, if the underlying proportions are 
each near 25% and an effective sample size of 400 is taken from each 
state, the comparison of the estimated proportions will have a standard 
error of about 0.03 and a coefficient of variation of about 12%. 
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TABLE 3 



Standard Errors and Coefficients of Variation of Comparisons" 
of Proportions for Two States as Functions of Sample Sizes 

Standard Errors (CV.'s) of Comparisons 



Actual Proportions in Each State 

Effective 
Sample Size 



in Each State 


.10 


■ 25 


■ 50 


■ 75 


.90 


100 


.042 


.061 


.071 


061 






(42%) 


(24%) 


(14%) 


(8%) 


(5%) 


124 


.038 


.055 


.063 


.055 


. 038 




(3C%) 


(22%) 


(13%) 


(7%) 


(4%) 


156 


.034 


.049 


.057 


.049 


. 034 




(34%) 


(20%) 


(11%) 


(7%) 


(4%) 


204 


.030 


.043 


.049 


.043 


. 030 




(30%) 


(17%) 


(10%) 


(6%) 


(3%) 


278 


.025 


.037 


.042 


.037 


025 




(25%) 


(15%) 


(8%) 


(5%) 


(3%) 




AO! 

. 021 


. 031 


.035 


.031 


.021 




(21%) 


(12%) 


(7%) 


(4%) 


(2%) 


625 


.017 


.025 


.028 


.025 


.017 




(17%) 


(10%) 


(6%) 


(3%) 


(2%) 


1,111 


.013 


.018 


.021 


.018 


.013 




(13%) 


(7%) 


(A%) 


(2%) 


(1%) 


1,600 


.011 


.015 


.018 


.015 


.011 




(11%) 


(6%) 


(4%) 


(2%) 


(1%) 


2,500 


.008 


.012 


.014 


.012 


.008 




(8%) 


(5%) 


(3%) 


(2%) 


(1%) 


4,444 


.006 


.009 


.01.1 


.009 


.006 




(6%) 


(4%) 




(1%) 


(1%) 


10,000 


.004 


.006 


.007 


.006 


.004 




(4%) 


(2%) 


(1%) 


(1%) 


(Z) 



Z • less than 0.5% 



The next table shows, the sample sizes needed to attain various 
levels of precision for an estimated proportion for a subpopulatior by 
itself (i.e., not as part of comparison). 

TABLE 4 

Standard Errors and Coefficients of Variation of Proportions 
as Functions of Sample Sizes 

Standard Errors (C.V.'s) of Proportions 



Effective Sample 




Actual Proportion in 


ouopopuiat ion 




Size in Sub- 


J 10 








Population 


■ 50 


7 ^ 




100 


. 030 


.043 


.050 


. J) 


no c 




(30%) 


(17%) 


(10%) 


(6%) 


(3%) 


124 


.027 


. 039 


.045 








(27%) 


(16%) 


(9%) 


(5%) 


(3%) 


156 


.024 


. 035 


.040 


. \J ^ D 


nil 
. u / / 




(24%) 


(14%) 


(8%) 


(5%) 


(3%) 


204 


.021 


. 030 


.035 




HA A 
. UO 0 




(21%) 


(12%) 


(7%) 


(4%) 


(2%) 


278 


.018 


. 026 


.030 


. w ^ u 






(18%) 


XVt ) 


(6%) 


(3%) 


(2%) 




. 013 


.022 


.025 


.022 


.015 




(15%) 


(9%) 


(5%) 


(3%) 


(2%) 


625 


.012 


.017 


.020 


.017 


.012 




(12%) 


(7%) 


(4%) 


(2%) 


(1%) 


1.111 


.009 


.013 


.015 


.013 


.009 




(9%) 


. (5%) 


(3%) 


(2%) 


(1%) 


1,600 


.008 


.011 


.012 


.011 


.008 




(8%) 


(4%) 


(2%) 


(1%) 


(1%) 


2,500 


.006 


.009 


.010 


.009 


.006 




(6%) 


(3%) 


(2%) 


(1%) • 


(1%) 


4,444 


.004 


.006 


.008 


.006 


.004 




(4%) 


(3%) 


(2%) 


(1%) 


(Z) 


10,000 


.003 


.004 . 


.005 


.004 


.003 




(3%) 


(2%) • 


(1%) 


(1%) 


(Z) 



Z - less than or equal to 0.5% 
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Tables 1 and 3 show that accurate sfate-level comparisons will not 
generally be possible unless the overall sample size is quite large. 
For example, to compare proportions in two states so that the standard 
error of the estimated difference is sure to be less than about .02 
requires an effective sample size of more than 1,111 per state (from 
Table 3, line 15 » column 3). If such accuracy were needed for all 
states, the total effective sample size would need to exceed 50,000 
seventeen-year-olds . 

Reasonable levels of precision in statistics for larger subgroups 
are attainable at more modest sample sizes. To estimate the average 
standard score for a subpopulation with a standard error of 0.025 
requires an effective sample size of 1600 for the subpopulation. If 
the subgroup is as large or larger than about one -sixteenth of the 
cohort of seventeen-year. olds (e.g., the subpopulation of black males 
is about this large) and the sampling rate for that subgroup is the 
same as for the cohort as a whole, then a total effective sample size 
of 25.600 is needed. 

The remainder of this paper will assume that the sample will have 
an effective size of about 26,000. This assumption implies that the 
actual number of students responding to the NAEP questionnaires must 
well exceed 26,000, due to the inevitable complexity of the sampling 
design to be used. The complexities include clustering of students in 
schools, unequal selection probabilities for students in different 
schools, unequal sampling rates for in- school and out- of -school seven- 
teen-year- olds , and other factors. For the present purposes, we may 
approximate the relationship between the needed number of students to 
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be assessed, and the effective sample size, n^ff, by the formula 

^ " ^clus * ddisp ■ n^ff 
where d^i^s represents the design effect due to clustering of students 
in schools and d^^j^^p represents the design effect due to dispropor- 
tionate sampling of in-school and out-of -school seventeen-year-olds. 
Past experience suggests that d^i^s ^^11 about 1.5 or so. We may 
approximate d^^gp by the formula 

ddisp - -OVf + .64/(l-f) 
with f - the fraction of the sf'jnple allocated to the out- of -school 
seventeen-year-olds. Determination of the optimal allocation of the 
sample to in-school versus out-of -school seventeen-year-olds is beyond 
the scope of this paper. 

Table 5 shows some alternative sample allocations yielding effec- 
tive sample sizes of 26,000. 

TABLE 5 

Illustrative Sample Allocation and Effective Sample Sizes 
(see text for assumptions) 



in-school sample out-of -school 

respondents sample respondents 

41,909 - . 3,592 

35,036 5,005 
32,500 6,500 



total sample effective 

respondents sample size 

45,501 26,000 
40,041 26,000 
39,000 26,000 



Note that the last allocation in Table 5 --with a total of 39,000 
respondents- -is not necessarily less costly than the others. The cost- 
effectiveness of the three allocations depends on the relative costs of 
assessing in-school versus out-of -school seventeen-year-olds. 
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Recently NAEP has assessed on the order of 30,000 seventeen-year- 
olds in a given year. In light of this fact, it is plausible that the 
desired number of out-of -school seventeen-year-old participants in NAEP 
should be around 5,000 or 6,000. 

3 . Sampling Strategies 

^The key to efficiently sampling the out-of -school seventeen-year- 
olds is to be able to identify them reliably and inexpensively. We 
will consider two kinds of sampling strategies. One strategy is based 
on sampling households (and other places of residence) . The other ^ 
strategy is based on sampling seventeen-year-olds at younger ages when 
they are almost all in school. 

First we consider household surveys. The U.S. population resides 
in approximately 85 million households.^ The Bureau of the Census 
reports that the households in 1980 contained about 4.1 million seven- 
teen-year-olds; due to declining birth rates in the 1960's, in 1990 the 
households will contain only about 3.5 million seventeen-year- olds . 
Making the simple but reasonable approximation that each household 
contains no more than one seventeen-year-old, we may estimate that 
somewhat fewer than 1 in 25 households will contain a seventeen-year- 
old. As only 1 out of 5 seventeen-year-olds is out-of -school , somewhat 
fewer than 1 in 125 households will contain an out-of -school seventeen- 
year-old. 

These figures--l in 25 households containing a seventeen-year-old 
-;nd 1 in 125 households containing an out-of -school seventeen-year-old- 

^The current number of households is slightly less, but in a few 
years the number will be about 85 million, or perhaps even 90 million. 
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- imply that a large number of households will need to be screened to 
yield a sample of the desired size. 

A sample of 100,000 households can be expected to yield 4,000 
seventeen-year-olds. Even if areas with high proportions of out-of- 
school seventeen-year-olds could somehow be efficiently^ identified and 
oversampled, it is doubtful that more than 1,000 - 2,000 out-of -school 
seventeen-year-olds could be sampled in this way. The screening costs 
alone for a 100,000 household survey would likely exceed 4 million 
dollars. ,Thus , conducting a special purpose household survey to sample 
out-of -school seventeen-year-olds will be very expensive and not very 
fruitful. 

An attractive alternative is to capitalize on existing large 
household surveys which could perform the screening at minimal cost. 
The Current Population Survey (CPS), fielded by the Bureau of the 
Census, is a monthly survey of some 60,000 households. Each successive 
month some households are retained from previous months and other, new, 
households are selected, so that over the course of a year some 200,000 
households are surveyed. By spreading out the screening for the out- 
of-school seventeen-year-olds for a twelve month period, it should be 
possible to sample some 2,000 out-of -school seventeen-year-olds. Not 
all of the out-of -school seventeen-year-olds will be residents of 
households, but it is possible to unambiguously link each non- 
household- resident seventeen-year-old to a residence; for example, a 
college student living in a dormitory would be eligible for selection 

^It is necessary that all of the out-of -school seventeen-year-olds 
have approximately equal selection probabilities, or else the effective 
s^pl-e size will decrease. 
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if the student's parents' STET (or step parent's, or legal guardian's, 
etc.) household were sampled. ^ In this way, a sample of approximately 
2,000 out-of-school seventeen-year-olds will be attainable. 

It is clear that 2,000 is too few out-of -school seventeen-year- 
olds to support an effective overall sample size of 26,000 unless some 
115,000 in-school seventeen-year-olds are sampled. Either this sample 
of 2,000 needs to be augmented, or we will have to live with a smaller 
effective sample size. For example, a combination of 2,000 out-of- 
school plus 33,000 in-school seventeen-year-olds yields an effective 
sample size of almost 17,000. 

We now turn to consider a second sampling strategy, which may be 
used either by itself or in combination with the CPS-based sample 
described above. The basis for this second strategy is the fact that 
almost all children in the U.S. aged 14 and below are enrolled in 
school at ages below 15. Thus, one way to obtain a sample of 
seventeen-year-olds is to sample thirteen-year-olds in a school-based" 
sample, keep track of the sampled students, and then recontact them 
four years later when they are aged 17 and include them in NAEP. 

To better describe the details of the strategy, we will first 
consider its hypothetical application to NELS:88, the National Educa- 
tional Longitudinal Study of the eighth grade class of 1988. In 1988 
(the base year) NELS:88 will sample approximately 30,000 eighth grade 
students from about 1000 public and private schools containing eighth 



^An application of this technique is described in M.R. Frankel, 
H.A. McWilliams, and B.D. Spencer, National Longitudinal Survey of 
Labor Fo rce Behavior .JYouth Survey (NLS). Technical Sampling Report 
NORC, August 1983. 
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graders. As 68% of the students in the eighth grade class of Fall 1981 
were age 13^, we may expect the NELS:88 base-year sample to yield more 
than 20,000 thirteen-year-olds, representing 65% - 70% of the cohort. 
NELS:88 will follow these students (or a subsample of them) and 
interview them" at age 15 in 1990 and at age 17 in 1992. The seventeen- 
year- olds in 1992 would need to respond to the NAEP questionnaire as 
well. In this way, all but between 30% and 35% of the seventeen-year - 
old cohort may be sampled. 

To see how to sample the remaining 30% to 35% of the seventeen- 
year- olds, consider the breakdown of seventeen-year- olds by status at 
age 13:^0 



enrolled in grade 8 68% 

enrolled in grade 7 18% 

enrolled in grades 1-6 3% 

enrolled in grades 9-12 8% 

in U.S. but not in school 1% 

not in U.S. 2% 

all seventeen-year-olds by 

status four years earlier 100% 



Nearly all of the 18% of the thirteen-year-olds who are enrolled in 
grade 7 attend schools eligible for inclusion in the NELS:88 sample, 
and they could easily be screened and included in the NELS:88 sample at 
slight extra cost beyond the currently planned NELS:88 surveying. Some 
portion of the 3% of the thirteen-year-olds enrolled in grades 1-6 and 
some slight fraction of the 8% enrolled in grades 9-12 also attend 



^Enrollment figures based on U.S. Bureau of Census, Current 
Population Reports . Series P-20 No. 400, Table 15; data for persons not 
in U.S. based on U.S. Bureau of the Census, 1980 Census of Population . 
PC 80-2-2A, Table 1. 

^^Same as above. 

^' 16 
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schools eligible for inclusion in the NELS:88 sample, hence they too 
are easily sampled by NELS:88. However, the remainder of the thirteen- 
year-olds enrolled in grades 1-6 and 9-12 do not attend schools that 
NELS:88 will sample, and so an extra sample of schools would be 
necessary to sample these students. Assuming these, students comprise 
about 9% of the thirteen-year-olds , an extra 70-80 schools would 
suffice to yield samples of these students with similar sampling 
weights and clustering characteristics to the main NELS:88 sample of 
thirteen-year-olds; of course, alternative sampling allocations could 
also be used. Thus, by slightly expanding NELS:88 97% of the seventeen 
cohort may be sampled. 

The remaining 3% of the cohort is comprised of the 2% who immi- 
grate after age 13 and the 1% who are not enrolled in school at age 13. 
Those immigrants who enter the school system after age 13 may be 
screened for by NELS:88 or NAEP during later school surveying. Those 
immigrants who do not enter the school system are problematic to 
sample. Similar comments apply to the 1% not enrolled in school at age 
13; if they enter the school system later they may be screened for in 
school surveying operations, and otherwise they are problematic. I do 
not have good suggestions at this time for these difficult- to- sample 
groups; perhaps there is some comfort in the fact that they comprise 
perhaps 2% or less of the cohort. On the other hand, the CPS is prone 
to possibly even larger under-coverage of the out-of -school seventeen- 
year- olds. 

This second strategy, of linking the seventeen-year-old sampling 
to NELS:88 (or other longitudinal survey of younger students) has the 

, 17 
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potential of yielding substantial numbers of out-of -school seventeen- 
year-olds. For example, the sampling strategy for NELS:88 rould yield 
30,000 seventeen-year-olds, of which 6,000 would be expected to be out- 
of -school. (The other 24,000 would not receive NAEP questionnaires, 
except through chance overlap of the NELS:88 and NAEP school samples.) 

An a^'.temative to linking the sample of out-of -school seventeen- 
year-olds to a longitudinal survey is to link it to a cross-sect:lonal 
survey, such as NAEP samples of schools curtaining fifteen-year-olds. 
In particular, NAEP could select a sample of fifteen-year- olds from the 
sampled schools and follow up the students two years later when th^j 
are aged seventeen. 

To see how this strategy would work, consider the breakdown of 
seventeen-year-olds by status at age 15^^: 



Z - less than 0.5% 



Assuming that NAEP will sample at least approximately 325 schools 
containing eleventh grade students and 475 schools containing eighth 
grade students, there shou?.d*be no particular difficulty in obtaining a 



-^Enrollment figures based on U.S. Bureau of the Census, Current 
Population Reports . Series P-20 No. 400, Table 15; data for persons not 
in U.S. based on U.S. Bureau of the Census, 1980 Census of Population . 
PC80-2-2A, Table 1, 



enrolled in grade 10 
enrolled in grade 9 
enrolled in grades 7-8 
enrolled in grades 1-6 
enrolled in grades 11-12 
in U.S. but not in school 
not in U.S, 

all seventeen-year-olds by status 
two year earlier 



100% 



62% 
22% 
3% 
Z 
8% 
3% 
2% 
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sample of more than 30,000 f if teex.-year-olds enrolled in grades 7-10. 
A very small proportion of fifteen-year-olds are enrolled in gradesj 1- 
6, and these could easily be screened for as part of the NAEP sampling 
of schools containing fourth grades. The remaiiiing 5% of the cohort is 
comprised- of the 2% who immigrate after age 15 and the 3% who are not 
in- school at age 15. These individuals will be somewhat difficult to 
handle, but we may attempt to sample them in the manner described 
earlier for the NELS : 88-linked sample. 

The difficulty of sampling the immigrants and t:he out-of -school 
fifteen-year-olds izi more problematic for the NAEP-linked strategy than 
for the NELS : 38 -linked strategy, because the numbers of these students 
are larger (5% of the cohort versus 3%) for the former than the latter. 
In principle, one could apply the NAEP-linked strategy to fourteen- 
year-olds rather than to fifteen-year-olds, but then the seventeen- 
year-old sample would have to occur three years after NAEP, rather than 
tvjo years after. 

If one can wait until 1992 for the out-of -school seventeen-year- 
old sample then the NELS : 88- linked strategy will be more cost-effective 
than NAEP-linked sampling of thirteen-year-olds because NELS: 88 will be 
following the majority of these students anyway, so data collection 
costs will be lower than for the NAEP-linked strategy. The NAEP-linked 
str:..tegy has the potential to yield 6,000 out-of -school seventeen-year- 
olds before 1992, so if a sample of but-of -school seventeen-year-olds 
in 1989, 1990, or 1991 is needed then a NAEP-linked strategy is 
recommended . 
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Conclusions 

Sfimpliiig out-of -school seventeen-year- olds will be best accom- 
plishf/d through cooperation with other ongoing survey efforts. The 
Current Population Survey has the potential capability to yield a 
sample of up to approximately 2,000 out-of -school seventeen-year-olds. 
NELS:88, suitably expanded, has the capability of producing a sample of 
6,000 out-of -school' seventeen-year-olds in 1992. By screening for 
fourteen-year-olds or fifteen-year-olds as part of ongoing NAEP surveys 
it is possible to yield a sample of 6,000 out-of -school seventeen-year- 
olds prior to 1992, perhaps as early as 198S. The needed sample size 
depends on the uses to be made of the data, but it is probably closer 
to 6,000 than to 2,000. 

The NAEP-linked strategy will be more expensive than the NELS:88- 
linked strategy, so if 1992 is a satisfactory time for the assessment 
of out-of -school seventeen-year-olds, the NELS : 88 - linked strategy is 
preferable. If the assessment is needed sooner, the NAEP-linked 
strategy is preferable. 
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